Exploratory analyses

Read counts per sample and genes to explore the quality of libraries

Read distribution (Raw data)

Violin plot indicates similar expression across samples but many genes need to be filtered due to low count values.

Number of expressed genes

Most samples express similar number of genes with at least 5 reads mapped to the gene.

Libraray size (reletive sequence depth)

Most sample have similar library size and it does not seem to relate with the sampling year. There is however more variation on Day 1.

Variation in sequencing depth can potentially introduce error to the model even if it is corrected for in the models. Consistent sample prep and equal amount of RNA maybe helpful in keeping it somewhat similar.

Read distribution (Filtered Data (day/year))

Read count distribution colored by day of sample prep

Read count distribution colored sample year

Sample Distance

Sample distance estimate does not suggest relationship between the age of the samples or the day of preparation. There maybe however larger variation in a larger study.

Sample Distance

PCA colored for Day of preparation and sampling year.

PCA by day PCA by day